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SUMMARY 

Lipases (Lip) hydrolyze triglycerides into fatty acids and glycerol. Lip produced by the yeast Candida cylindracea are 
encoded by multiple genomic sequences. We report the molecular cloning and characterization of three genes from this 
family. They encode putative mature 57-kDa proteins of 534 amino acids (aa). To date, five Lip-encoding genomic 
sequences from C. cylindracea have been characterized in our laboratory. The five deduced aa sequences share an overall 
homology of 80%. These sequences have been aligned with each other and with those of homologous enzymes, the Lip 
from the mould Geotrichum candidum and the acetylcholinesterase from Torpedo californica, whose three-dimensional 
structures have been solved by X-ray analysis. The C. cylindracea Lip appear to have a structural organization similar 
to that described for both enzymes. 



INTRODUCTION 

Lipases (Lip; triacylglycerol acyl hydrolases) are key 
enzymes in fat metabolism that catalyze the breakdown 
of triacylglycerols to free fatty acids and glycerol. They 
constitute an ubiquitous group of enzymes able to cata- 
lyze a number of different reactions, many of them of 
industrial interest (reviewed in Harwood, 1989). Lip hy- 
drolyze ester bonds of water-insoluble substrates at the 
interface between the organic phase containing the emul- 
sified substrate and the aqueous phase in which the 
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enzyme is soluble. Lip activity is largely increased at the 
lipid-water interface (Brockman, 1984). 

The biochemical and molecular characterization of a 
number of Lip obtained from different sources has 
pointed to a marked heterogeneity of this class of en- 
zymes with regard to specificity, aa sequence and catalytic 
properties (Antonian, 1988). Based initially on the inhibi- 
tion of enzymatic activity by chemical modification, Lip 
have been classified in the Ser hydrolases group (Chapus 
et al., 1976). The serine of the active site has been shown 
to be enclosed in a highly conserved Gly-Xaa-Ser- 
Xaa-Gly motif. This consensus domain represents the 
only feature shared by all determined lipase sequences 
(Antonian, 1988). 

Recently, a very significant contribution to the under- 
standing of the Lip structure and function relationships 
arose from the elucidation of the three-dimensional struc- 
ture of Lip from the human pancreas (Winkler et al., 
1990), Rhizomucor mxehei (Brady et al., 1990) and Geo- 
trichum candidum (Gel; Schrag et al., 1991). The three 
enzymes differ in their sequences (Boel et al., 1988; 
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Shimada et al, 1989; 1990; Winkler et al., 1990) and in 
several structural features. Nevertheless, significant sim- 
ilarities have been observed in the topology of the mixed 
P-sheet conserved in the protein hydrophobic core and 
in the configuration of the Ser-His-Asp/Glu catalytic 
triad, functionally equivalent to that described for Ser 
proteases. The active site is buried in a deep hydrophobic 
cavity covered by amphipathic helical elements named 
by the authors 'lid' or 'flap'. Interfacial activation appears 
to imply a conformational change of the *lid\ making the 
active site accessible to the substrate. This suggested 
mechanism finds support in the x-ray studies performed 
on a complex of Rhizomucor lipase with a substrate ana- 
logue, where the flap appeared rotated by about 167° 
(Brzozowski et al., 1991). 

The extracellular Lip produced by the asporogenic 
yeast C. cylindracea (Ccl) hydrolyzes triglycerides with- 
out specificity both in the attacked position of the gly- 
cerol molecule and in the nature of the fatty acid released. 
This enzyme has therefore a relaxed specificity in compar- 
ison with other Lip. This property makes it useful for 
industrial applications. In previous reports from our 
laboratory, it has been shown that multiple Lip-related 
sequences are present in the C. cylindracea genome 
( Alberghina et al, 1 99 1 ), and the sequences of two of them 
(LIP] and LJP2) have been determined (Longhi et ah, 
1992). In this paper we report the cloning and sequencing 
of three other Lip genes and an analysis of the sequences 
of the C. cylindracea LIP multigene family, compared 
with each other and with those of related enzymes. On 
the basis of these results, we propose a model for the 
structural organization of the C. cylindracea Lip. 

RESULTS AND DISCUSSION 

(a) Cloning of three genomic UP sequences 

C. cylindracea genomic DNA was digested to comple- 
tion with restriction endonucleases and subjected to 
Southern blot analysis using fragments derived from 
LI PI (Longhi et al., 1992) as probes (fig. 1A). Duplicate 
blots were hybridized to probes specific for either the 5'- 
end or the 3'-end of the gene. Multiple hybridization sig- 
nals were observed (Fig. IB). In the pattern obtained with 
Sad-cleaved DNA, four bands were positive to both 
probes (not shown). The two Sacl bands of about 4.2 and 
5.1 kb (Fig. IB) are likely to correspond to the two geno- 
mic clones previously isolated in our laboratory (Longhi 
et al., 1992). Moreover, the complexity of the hybridiza- 
tion pattern indicated the presence of other closely related 
homologous sequences. Therefore, total yeast DNA was 
cleaved with Sacl, size selected for fragment lengths 
> 7 kb and used to construct another genomic library. 



Screening of ca. 7000 recombinants led to the isolation 
of three positive clones carrying 10 kb Sacl-Sacl inserts. 
One of them was further characterized by a fine hybrid- 
ization analysis using five probes representative of the 
whole LI PI sequence, as indicated in Fig. 1 A. This clone 
was found to contain three sequences apparently oriented 
head to tail, hybridizing to all five probes and showing 
distinctive restriction patterns (Fig. 1C). 

(b) Nucleotide sequence of LIP and deduced aa sequence 

L/P-homologous sequences (L/P5, LIP4 and LIPS) 
were separately subcloned into pGEM3Z (Promega, 
Madison, WI) and their complete nt sequences were 
determined on both strands (Fig. 2). The nt sequences 
obtained were translated into the corresponding aa se- 
quences taking into account the unusual usage of the 
codon CTG — Ser instead of Leu — occurring in this Can- 
dida strain (Kawaguchi et al., 1989). 

The LIP3 sequence contains a unique ORF of 1647 nt 
terminating with a TAG stop codon (Fig. 2A). The de- 
duced aa sequence corresponds to a protein of 549 aa 
with an N-terminal stretch of 15 hydrophobic aa, which 
might encode a signal peptide. The cleavage of the pre- 
protein by the signal peptidase is predicted to occur after 
the tripeptide ValAlaAla (von Heijne, 1986). LIP3 would 
therefore produce a mature protein of 534 aa and M T 
57 291. Three potential yV-glycosylation sites (Asn-Xaa- 
Ser/Thr; Hubbard and Ivatt, 1981) have been identified 
at aa positions 291, 314 and 351. Five Cys residues are 
present at aa positions 60, 97, 217, 268 and 277. The 
predicted pi (isoelectric point) of the Lip3 protein is 5.1. 

The LIP4 sequence spans an ORF of 1647 nt closed 
by a TAG codon, corresponding to a preprotein of 549 
aa (Fig. 2B). Upon cleavage of the signal peptide this 
would result in a mature protein of 534 aa and M T 57 051. 
The putative Lip4 contains a unique site for N-glycosyla- 
tion (Asn 351 ), and six Cys at positions 60, 97, 217, 268, 
277 and 328. Its calculated pi is 5.7. 

The LIPS ORF is 1647 nt long and is terminated by a 
TAG stop codon, corresponding to a mature protein of 
534 aa and M r 56.957, preceded by a leader peptide of 
15 aa. Lip5 contains two putative N-glycosylation sites 
(Asn 314 and Asn 351 ) and five Cys at position 60, 97, 217, 
268 and 277. Its calculated pi is 5.5. 

(c) Comparison of the deduced aa sequences 

LIP3, LIP4 and LIPS encode for putative proteins of 
534 aa highly related in sequence to each other and to 
the Lipl and Lip2 from the same organism (Longhi et al., 
1992). All five LIP sequences will be considered in the 
following analyses. Residues conserved in all five se- 
quences make up 66% if identity is considered and 84% 
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Fig. L C. cylindracea genome analysis and cloning of LIP. (A) Location of the probes (ca. 200 bp) in LIP I. DNA fragments were purified from 
agarose gels and labeled by random priming with [ 32 P]dCTP, Probes are marked by different symbols. (Panel B) Southern blots of genomic DNA 
digested with restriction endonucleases and hybridized to the Sac]- Aval probe specific for the 5'-end of LIP. A similar hybridization experiment was 
performed with the Pstl-EcoRV probe specific for the 3'-end (not shown). Sad bands positive to both probes are indicated by open triangles. 
Hybridization was carried out at 45*0. (C) Location of LIP homologous sequences within the 10-kb Sacl-Sacl fragment as deduced by Southern 
analysis with the probes described in A. Hybridization conditions were as above. DNA regions positive to the same probe are indicated by the same 
symbol. Only restriction sites used for this analysis are reported. Coding sequences and their orientation are represented by arrows. 



if aa are grouped on the basis of their physicochemical 
similarity as specified in the SIMPLIFY program of the 
GCG sequence analysis package (Devereux et al., 1984). 
The Lip consensus sequence Gly-Xaa-Ser-Xaa-Gly is 
conserved in all sequences (Fig. 2). A common feature of 
the putative Lip is the presence of hydrophobic signal 
peptides, whose cleavage would generate mature proteins 
with Ala as the first aa. These observations are in 
agreement with the results obtained by sequencing the N 
terminus of the mature protein (Kawaguchi et al., 1989). 



The M r predicted for the Lip are in agreement with those 
experimentally estimated by gel electrophoresis of puri- 
fied Ccl commercial preparations (Veeraragavan and 
Gibbs, 1989). Potential sites of glycosylation, one of 
which (Asn 351 ) is conserved in all five proteins, have been 
identified. Biochemical analyses have shown that Ccl con- 
tains 4% sugars, mainly mannose and xylose (Tomizuka 
et al., 1966). Four Cys residues are conserved, which 
might participate in the formation of disulfide bridges. 
Analysis of 5' regions showed the presence of TATAA 
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AAGGC AC C AAGGC GGT 

CAAGACACAAAGAGCGT GGGGAGGTGAC_AAT_GCC GATCGCGGACCAGTTTACACCGCTCC GAGT_AJ_AAAAGC AGAAGC ATTCTCACCTGCTCGCTCCCC 
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ACG TTG GTG GCT CTG AC G GGC TGC AGC ACT GCC AGC A AC A AG CTT GCG TGC TTG CGT GGT CTT CTG ACT C AG GCA 

Thr Leu Val Ala Ser Thr Gly Cys Ser Ser Ala Ser Asn Lys Leu Ala Cys Leu Arg Gly Leu Ser Thr Gin Ala 

TTG CTC GAT GCC ACC AAC G AC ACC CCT GGG TTC TTG TCG T AC ACC TCG TTG CGG TTG CTG T AC CTT CCC CGG CCC 

Leu Leu Asp Ala Thr Asn Asp Thr Pro Gly Phe Leu Ser Tyr Thr Ser Leu Arg Leu Ser Tyr Leu Pro Arg Pro 

GAC GGC GCC | AAC AT C ACC[ GAT G AC AT G T AC A AG TTG GTA CGC GAC GGC A AG TAT GCA AGC GTT CCC GTG ATC ATT 

Asp Asp Met Tyr Lys Leu Val Arg Asp Gly Lys Tyr Ala Ser Val Pro Val lie lie 



GGC GCC 
Asp Gly Ala 



AAC ATC ACC 
Asn He Thr 



GGC GAC CAG AAC GAC GAG GGC 
Gly Asp Gin Asn Asp Glu Gly 

TAC CTC AGA AAG CTG TTC ATC 
Tyr Leu Arg Lys Ser Phe I le 



TTC 
Phe 



CAC 
His 



TTG TTT GGT CTC CTG CTG TTG 
Leu Phe Gly Leu Ser Ser Leu 



AAC ACC ACC 
Asn Thr Thr 



ACC GAG GCC GAC GCC GAG GCA 
Thr Glu Ala Asp Ala Glu Ala 



ACC CAG GGT CTG CCG TTC GAC ACG 
Thr Gin Gly Ser Pro Phe Asp Thr 



GCC 
Ala 

GGC 

Gly 



ACC GAC GCC GAT ATC ACC GCA TTG AAG GCG GCG TAC CCC AGC GAT GTC 
Thr Asp Ala Asp He Thr Ala Leu Lys Ala Ala Tyr Pro Ser Asp Val 



ATT CTC AAC GCC CTT AC A CCC 
lie Leu Asn Ala Leu Thr Pro 



CAG CTC 
Gin Leu 



AAG CGG ATC AAT GCT GTG CTT 
Lys Arg lie Asn Ala Val Leu 



GGC GAC CTC ACC TTT ACC CTC TCG CGC CGC TAC TTC CTC AAC CAC TAC ACC GGT GGT CCC AAG TAC TCG TTC CTC 



900 
285 



975 
310 



1 050 
335 



1 125 
360 



1200 
385 



1275 
410 



1350 
435 



Gly Asp Leu Thr Phe Thr Leu Ser Arg Arg Tyr Phe Leu Asn His Tyr Thr Gly Gly Pro Lys Tyr Ser Phe Leu 

CTG AAG CAG CTT CTG GGC TTG CCC ATT CTC GGC ACG TTC CAC GCG AAC GAC ATT GTG TGG CAG CAC TTT TTG TTG 
Ser Lys Gin Leu Ser Gly Leu Pro Me Leu Gly Thr Phe His Ala Asn Asp lie Va L Trp Gin His Phe Leu Leu 

GGC AGC GGC AGC GTC ATC TAC AAC AAC GCG TTC ATC GCG TTT GCC ACC GAC TTG GAC CCC AAC ACC GCG GGC TTG 
Gly Ser Gly Ser Val lie Tyr Asn Asn Ala Phe He Ala Phe Ala Thr Asp Leu Asp Pro Asn Thr Ala Gly Leu 

CTG GTG CAG TGG CCC AAG TCC ACC AGC AGC CTG CAG GCG GGG GAC AAC TTG ATG CAG ATC AGT GCC TTG GGC TTG 
Ser Val Gin Trp Pro Lys Ser Thr Ser Ser Ser Gin Ala Gly Asp Asn Leu Met Gin lie Ser Ala Leu Gly Leu 

TAC ACC GGC AAG GAC AAC TTC CGC ACC GCC GGC TAC AAC GCT TTG TTT GCC GAC CCG CTG CAC TTT TTC GTG TAG 
Tyr Thr Gly Lys Asp Asn Phe Arg Thr Ala Gly Tyr Asn Ala Leu Phe Ala Asp Pro Ser His Phe Phe Val 

ACAATGTACATGTGCACCCCCAAGAAGCCTGCTTCGTCC ACCAGCCC 
Fig. 2. The nt sequences of UP3 (A), LIP4 (B) and UPS (C) and deduced aa sequences. The nt are numbered starting from the start codon as 
position A= +1. The aa are numbered taking the first aa of the mature proteins as position +1 (to the right of black arrowhead). TATA A and 
CAAT-like sequences are underlined. The black arrowhead indicates predicted signal peptidase cleavage sites. Lip consensus sequences are marked 
by asterisks, and potential /V-glycosylation sites are boxed. CTG-Ser are in bold-face type. The nt sequences were determined by the dideoxy chain- 
termination method (Sanger et a!., 1977). Sequences have been submitted to the EMBL Databank under accession Nos X66006 {LIP3) X66007 
(LIP4) and X66008 {LIPS). 



U25 
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1500 
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and CAAT boxes enclosed in conserved sequence regions 
(Fig. 3). CAAT and TATAA boxes are implicated in the 
initiation of transcription in eukaryotes and have been 
found upstream from genes highly expressed in Candida 
species (Singer et al., 1989). Further evidence of Lip ex- 
pression was inferred from the comparison with the re- 
striction maps of Ccl cDNAs isolated by other authors 
(Kawaguchi and Honda, 1991). 

The predicted pis (4.5 for Lipl, 4.9 for Lip2, 5.1 for 
Lip3, 5.7 for Lip4 and 5.5 for Lip5) would support a 



LIP2 
LIP3 
LIP4 
LIPS 



LIP2 
LIP3 
LIP4 
LIPS 



-160 -150 
AGGCTTGTEfcGCApGh'CACTGCTq 
.... GGATpffcGTGpGfTCTGCCCGC 
GJ3ATGCCGTA 



-130 -120 
GGCCAGSftTGTGGpCtc 
CGCTGGCpCCCCGCcKT 

GCAGcddcccTTifccbc 



-110 
CACGCCCCGJTI 
TGCCCGG, 
ATGTTGC, 
-AAGGCACfc^AGi 




-90 -80 
iGTCTTTACTGCATTGGAGCCA 
■CCTfifeCGGTfFbACAA CAAT GicGG 
GACmCAAOTRGCTGATCGG 
GACACAAB^AGCGfapGGGAGGTGA 





-30 



LIP2 
LIP3 
LIP4 
LIP5 



-10 - 1 

TCtCA-CTBq^TCT l ffl l , AT( J 

TCTCACCTGCTI " 
TCGC@:cgGCT< 
TCTCACCTGCTt 



Fig. 3. Sequences upstream from LIP2. 3, 4, and 5. TATAA and CAAT 
boxes are underlined. The nt conserved in at least three sequences arc 
boxed. Last digits of numerals are aligned with corresponding nt. 



correlation among the putative proteins encoded by the 
genes isolated in our laboratory and the Lip isoforms 
detected in the enzyme purified preparations by different 
experimental approaches, such as two-dimensional gel 
electrophoresis (Veeraragavan et al., 1989) and affinity 
chromatography (E. Cernia, G. Ortaggi, M. Castagnola 
and R. Rubino, personal communication). Although gly- 
cosylation or other post-translational modifications could 
affect the pi predicted for the protein moiety to some 
extent, it is likely that the information provided by the 
sequence analysis could be useful for the separation and 
identification of C. cylindracea Lip isoforms. 

In yeast, genes belonging to multigene families have 
been reported that are expressed under different growth 
conditions or produce enzymes differing in their substrate 
preference (Pretorius et aL, 1991). Interestingly, a strain 
of G. candidum synthesizes Lip with a different substrate 
specificity (Sidebottom et al., 1991). The production of 
Lip isozymes could therefore be of physiological rele- 
vance in conferring to the yeast versatility in adaptation 
to different environments. 



(d) Multiple alignment with homologous sequences 

Screening of data bases by the FASTA program 
showed that the Lip from C. cylindracea are very weakly 
related to most other Lip, with the only exception being 
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the two Lip i so forms from G. candidum (40-44% identity 
over 544 aa aligned). Moreover, significant homology 
was observed to proteins belonging to the ChE family. 
The highest sequence similarities were found with AChE 
from T. californica (TAChE) and T. marmorata (27-31% 
identity over 446 aa aligned). The three-dimensional 
structures of both Gel and TAChE have been recently 
solved by x-ray diffraction (Schrag et aL, 1991; Sussman 
et aL, 1991). 

In order to gain some structural information concern- 
ing Ccl, sequence analysis of the Lip has been performed. 
This analysis can take advantage of two conditions: (i) 
multiple sequences belonging to the same gene family are 
available, (n) structures of two different proteins, both 
homologous to Ccl, have been described. Fig. 4 shows 
the multiple alignments of the aa sequences encoded by 
LIP (Lip 1-5), the sequences reported for two G. candidum 
isoforms (Shimada et aL, 1989; 1990) and best scoring 
esterase sequences in the FASTA output. Sequences of 
the mature proteins have been considered. 

(e) Secondary structure elements 

Sequence homology, when available, is at present the 
most reliable tool for secondary structure prediction. The 
identity relating the Ccls to Gels and TAChE is greatly 
above the threshold of significance (25% identity for more 
than 80 residues aligned) introduced by Sander and 
Schneider (Sander and Schneider, 1991). In this case, the 
reliability of the method employed for the alignment can 
be assessed, since two sequences of known structure have 
been enclosed. As shown in Fig. 4, corresponding do- 
mains in the secondary structures of Gels and TAChE 
are indeed aligned. The good superimposition of the two 
reference sequences with those of Lip suggests that the 
Ccl might have a similar structural organization. 

(f ) The helical lid 

Despite the high overall similarity, local sequence di- 
vergences are observed in Lip in comparison with the 
Gel. In particular, the sequences of the first, second and 
eighth helices of Gel cannot be satisfactorily aligned to 
the corresponding regions of the Lip sequences, even if 
gaps are introduced (Fig. 4). The lack of significant 
homology in these regions suggests that they might have 
a different structural organization in Lip. It should be 
noted, however, that several statistical methods for 
secondary structure prediction all consistently predict a 
helical conformation for the region 63-94 of the Lip 
sequences (data not shown). These helices have been de- 
scribed to build up the Mid' covering the active site of the 
Gel and proposed to undergo conformational changes 
during the process of interfacial activation of this enzyme 
(Schrag et aL, 1991). It is intriguing that the high and 



continuous sequence homology between the two fungal 
Lip is abruptly interrupted in these regions. It could be 
hypothesized that such local divergences form the basis 
of the different substrate specificity of the two enzymes. 
Corresponding aa stretches are not conserved in the 
AChE sequences. In these cases, the lack of a structure 
analogous to the lid has been demonstrated by crystallo- 
graphy. Based only on the alignment of Fig. 4 and in the 
absence of direct structural information, it is not possible 
to hypothesize whether other structural elements might 
provide the Mid 1 function to Cel. It is also possible that 
the regions corresponding to the Mid' elements can protect 
the active site or that conformational changes different 
from those hypothesized for the other Lip have an ana- 
logous functional role for the mechanism of interfacial 
activation in Candida Lip. 

(g) Structural organization 

Key residues stabilizing the folding of Gel and TAChE 
are conserved in Ccl (Fig. 4). Charged residues forming 
salt bridges in Gel (Arg 38 /Glu 103 and Glu 180 /Arg 290 ) and 
in TAChE (Arg^/Glu 92 and Glu 163 /Arg 267 ) are con- 
served at equivalent positions in the Ccl family. This 
would suggest a role for the corresponding pairs 
Arg 37 /Glu 95 and Glu 172 /Arg 279 in the stabilization of the 
tertiary structure of the Ccl (Fig. 4). Cys residues forming 
two disulfide bridges in Gel (Cys 61 /Cys 105 and 
Cys 276 /Cys 288 ) and two out of the three disulfide bridges 
in TAChE (Cys 67 /Cys 104 and Cys 255 /Cys 265 ) are con- 
served also in the Lip family. The only exception concerns 
Cys 268 of Ccl, Cys 254 of TAChE and Cys 276 of Gel that 
are shifted by a few residues with respect to each other 
and to their partners in the disulfide bridge (Cys 277 , 
Cys 265 and Cys 288 , respectively). It is therefore likely that 
the frame of disulfide bridges is maintained in the three 
enzymes, where in Ccl the four Cys we hypothesize to be 
connected are Cys 60 /Cys 97 and Cys 268 /Cys 277 (Fig. 4). 
The third disulfide bridge of AChE is not present in 
fungal Lip. The strong conservation of residues involved 
in protein structure stabilization is suggestive of a general 
similarity of the three-dimensional structures of the three 
enzymes. 

(h) The catalytic triad 

Crystallographic data on the Gels and TAChE pointed 
out the presence of a trypsin-like catalytic triad in their 
active site. Residues composing the catalytic triad of Gel 
(Ser 217 , Glu 354 and His 463 ; Schrag et al. 1991) and of 
TAChE (Ser 200 , Glu 327 and His 400 ; Sussmann et aL, 1991) 
are conserved in the Lip sequences (Fig. 4). Moreover, 
these residues are surrounded by highly conserved re- 
gions. Our results strongly support the hypothesis 
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Fig 4. Multiple alignment (Devereux et al., 1984) of the putative Lip {Lipl -5), two Lip iboforms from C candidum (Gcll and Gcl2), ChE from Homo 
sapiens (ChleSHuman) and Rattus norvegicus (ChleSRabit) and AChE from T. marmoroia (AcesSTorma), mouse (AcesSMouse) and T. californica 
(AcesSTorca). The aa are numbered starting from the first aa of the mature proteins. Identical aa conserved among more than 50% of the sequences 
are boxed. Symbols on the top and the bottom of each sequence block refer to Gcll and AcesSTorca, respectively. Active site residues are indicated 
by asterisks, secondary structure elements by open bars, residues involved in salt bridges by diamonds, where the same filling pattern identifies partner 
residues in the bridge. Paired Cys are indicated by upward-pointing carets, downward-pointing carets, and open squares (the last is present only 
m AChE). 
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(Schrag et ah, 1991) that Ser 209 , Glu 341 and His 449 are 
part of the active site of Cel. 

(i) Evolutionary analysis 

The active Ser residue of the superfamily of serine hy- 
drolases is embedded in the consensus motif GlyXaa^er- 
Xaa 2 Gly, where in Lip Xaa x is usually either Tyr or His. 
The consensus sequence within the AChE family is re- 
ported to be PheGlyGluSerAlaGly (Bairoch, 1990). Inter- 
estingly, the catalytic Ser of the two fungal Lip is 
surrounded by a consensus sequence characteristic of 
AChE. This motif is highly conserved in the Lip se- 
quences, the only exception being Lip2 in which a Tyr 
substitutes for Phe. This observation - if substantiated 
by the isolation of an active product of LIP2 - would 
suggest either that in the first position of the consensus 
sequence any aromatic side chain might be allowed or it 
might correlate with differences in catalytic function or 
in substrate specificity. The carboxylic chain of the cata- 
lytic triad appears to be provided by glutamate instead 
of aspartate, as in other Ser hydrolases. AChE (Sussman 
et al., 1991), the Gel (Schrag et al., 1991) and the Ccl are 
the only examples reported to date of Ser hydrolases 
using a Ser-His-Glu catalytic triad. Recently, Krejci and 
collegues (1991) proposed grouping in the same protein 
family proteins containing a common ChE-like domain. 
This family comprises several Ser hydrolases, including 
esterases, AChE and Gel. On the basis of the sequences 
determined for LIP, Ccl would also belong to this family. 

These observations, taken together with the overall 
sequence homology among Gels, Ccls and AChE, sug- 
gest a common ancestor gene for these enzymes. The 
two fungal Lip could therefore be considered as a bridge 
between the esterases and the Lip family (Fig. 5). 

( j) The genetic code 

A unique feature of the C. cylindracea strain 
ATCC14830 from which we have cloned LIP is the use 
of a non-universal genetic code. By comparing nt and aa 
sequences, it has been demonstrated that in this strain 
the codon CTG codes for Ser instead of Leu (Kawaguchi 
et al., 1989). Interestingly, CTG is used with a high fre- 
quency (3% of the codons) in LIP, including that corre- 
sponding to the catalytic Ser. Out of the 24 CTG-Ser, 13 
are conserved in the Lip family. At non-conserved posi- 
tions, CTG-Ser were found to be substituted by Ser (other 
codons). Leu, Ala, Pro, Trp, Asn or His. How such a 
phenomenon could originate and be maintained through- 
out evolution remains to be clarified. 

(k) Conclusions 

(/) In the yeast C. cylindracea are present several Lip 
genomic sequences (at least five but possibly even more). 



I Chle$Human 



1 Chle$Rabit 



i— Aces$Torca 



L Aces$Torma 



Aces$Mouse 
Gcll 



Gcl2 
Lip2 
Lip4 



I Lipl 

' Lip3 

1 Lip5 

Fig. 5. Tree representation (Devereux et al., 1984) of the clustering rela- 
tionships among the sequences aligned (and explained) in Fig. 4. 

(2) The availability of multiple genes belonging to the 
same family provides an useful tool for the identification 
of structurally significant sequence features. The more 
interesting observation emerging from the comparison of 
the Lip sequences is the conservation of the consensus 
sequence Phe/Tyr-Gly-Glu-Ser-AIa-Gly surrounding the 
active Ser 209 , supporting a close relation of the C. cylin- 
dracea enzymes with the ChE family. 

(3) By the use of sequence alignments we have inferred 
a tentative model of the structural organization of Ccl 
based mainly on the crystallographic data reported for 
the homologous Gel. The overall structures of the two 
fungal Lip are likely to be very similar, except for the 
helical chains building up the 'lid' in Gel. 
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